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ABSTRACT 

Using a homogenous sample of 38,208 quasars with a sky coverage of - 4000 deg2 drawn from the SDSS 
Data Release Five quasar catalog, we study the dependence of quasar clustering on luminosity, virial black 
hole mass, quasar color, and radio loudness. At z < 2.5, quasar clustering depends weakly on luminosity and 
virial black hole mass, with typical uncertainty levels ^ 10% for the measured correlation lengths. These 
weak dependences are consistent with models in which substantial scatter between quasar luminosity, virial 
black hole mass and the host dark matter halo mass has diluted any clustering difference, where halo mass is 
assumed to be the relevant quantity that best correlates with clustering strength. However, the most luminous 
and most massive quasars are more strongly clustered (at the ^ 2a level) than the remainder of the sample, 
which we attribute to the rapid increase of the bias factor at the high-mass end of host halos. We do not observe 
a strong dependence of clustering strength on quasar colors within our sample. On the other hand, radio-loud 
quasars are more strongly clustered than are radio-quiet quasars matched in redshift and optical luminosity (or 
virial black hole mass), consistent with local observations of radio galaxies and radio-loud type 2 AGN. Thus 
radio-loud quasars reside in more massive and denser environments in the biased halo clustering picture. Using 
the Sheth et al. (2001) formula for the linear halo bias, the estimated host halo mass for radio-loud quasars is 

lO'-' /!~'M0, compared to ~ 2 x 10'^ h~^MQ for radio-quiet quasar hosts at z 1.5. 
Subject headings: black hole physics - galaxies: active - cosmology: observations - large-scale structure of 
universe - quasars: general - surveys 



1. INTRODUCTION 

Quasars are luminous, accreting supermassive black holes 
(SMBHs) that are thought to reside at the center of almost 
every massive galaxy (e.g., Salpeter 1964; Zel'dovich & 
Novikov 1964; Lynden-Bell 1969; Kormendy & Richstone 
1995; Richstone et al. 1998). Despite their high luminosi- 
ties and perhaps different BH fuelling mechanism, quasars 
are no different from their low-luminosity counterparts, active 
galactic nuclei (AGNs): they both represent stages of build- 
ing up SMBHs, and they interact with the host galaxies in 
a self-regulated way (e.g.. Silk & Rees 1998; Magorrian et 
al. 1998; Ferrarese & Merritt 2000; Gebhardt et al. 2000; 
Kauffmann & Haehnelt 2000; Wyithe & Loeb 2002, 2003; 
King 2003; Di Matteo et al. 2005; Hopkins et al. 2006; Cro- 
ton et al. 2006). Therefore understanding the properties of 
quasars/ AGNs, such as their abundance (luminosity function), 
spatial distributions (clustering), physical properties (spectral 
energy distributions; black hole masses, etc.) and the depen- 
dence of these properties on redshift, is crucial to understand 
galaxy formation and SMBH growth within the standard hier- 
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archical structure formation framework. 

In this work we focus on the spatial clustering of quasars. 
Similar to galaxies, quasars can be used to trace the large- 
scale distribution of the underlying dark matter. Because of 
their high luminosity, quasars can be seen at redshift < z < 6 
in a single survey, allowing them to be used to map the dark 
matter (DM) distribution over this entire redshift range. How- 
ever, owing to the enormously large spatial volume probed 
and the fact that the quasar number density is much smaller 
than that of galaxies, the analysis of quasar clustering has only 
become practical in recent years, namely after large-scale sur- 
veys such as the 2dF quasi-stellar object (QSO) redshift sur- 
vey (2QZ, Croom et al. 2004) and the Sloan Digital Sky Sur- 
vey (SDSS, York et al. 2000). Results on quasar clustering 
have been reported using the 2QZ samples (e.g., Porciani, 
Magliocchetti & Norberg 2004; Croom et al. 2005; da An- 
gela et al. 2005; Porciani & Norberg 2006), the SDSS sam- 
ples (e.g., Myers et al. 2006, 2007a, 2007b; Shen et al. 2007a, 
2008a; Ross et al. 2008), and the 2dF-SDSS LRG and QSO 
survey (2SLAQ, Croom et al. 2008) sample (e.g., da Angela 
et al. 2008). Complementary to these results, quasar clus- 
tering has also been studied through cross-correlations with 
large galaxy samples (e.g., Adelberger & Steidel 2005; Coil 
et al. 2007; Padmanabhan et al. 2008), and surveys for close 
quasar pairs (e.g., Hennawi et al. 2006; Myers et al. 2008). 

While the statistics of quasar clustering are not yet compa- 
rable to those of local galaxy clustering, recent measurements 
of the quasar two-point correlation function (CF) already offer 
valuable information about the physical properties of quasars, 
such as their host dark matter halo masses and duty cycles. 
The amplitude of the quasar correlation function suggests that 
quasars live in massive dark matter halos, and therefore are bi- 
ased tracers of the underlying dark matter (e.g.. Kaiser 1984; 
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Bardeen et al. 1986); the bias increases with increasing red- 
shift (e.g., Croom et al. 2005; Myers et al. 2007a; Shen et 
al. 2007a; da Angela et al. 2008; Ross et al. 2008). By com- 
paring the relative abundance of quasars and host halos, one 
can infer the average duty cycles of quasar activity'" (e.g., 
Cole & Kaiser 1989; Martini & Weinberg 2001; Haiman & 
Hui 2001). With substantial uncertainties, the estimated host 
DM halo mass is 

2012-13 (g Porciani et al. 2004; 
Croom et al. 2005; Myers et al. 2006, 2007a; Shen et al. 
2007a; da Angela et al. 2008; Padmanabhan et al. 2008), 
and the estimated quasar lifetime is between a few Myr and 
10** yr. Quasar clustering measurements have also facilitated 
several theoretical investigations on the cosmic evolution of 
SMBHs within the hierarchical structure formation paradigm 
(e.g., Hopkins et al. 2007; Shankar et al. 2008a; White et 
al. 2008; Croton 2008; Shankar et al. 2008b; Wyithe & Loeb 
2008). 

Previous investigations also examined the potential lumi- 
nosity dependence of quasar clustering. If quasar luminosity 
is a good indicator of the host halo mass, then more lumi- 
nous quasars live in more massive halos, and hence should 
have stronger clustering. However, the current observations 
only show weak luminosity dependence of quasar clustering 
at z < 2.2 (e.g., Adelberger & Steidel 2005; Croom et al. 
2005; Porciani & Norberg 2006; Myers et al. 2007a; da An- 
gela et al. 2008), albeit with large error bars and, in most 
cases, limited dynamical range (< 1 dex in luminosity). If 
indeed quasar clustering depends only weakly on luminosity, 
then low luminosity quasars can also live in high mass halos 
and vice versa, which means there is substantial scatter in the 
host halo mass-quasar luminosity relation. This is naturally 
expected in some evolutionary quasar models, where quasar 
luminosity also depends on the evolving accretion rate onto 
the SMBH (e.g., Hopkins et al. 2005; Hopkins et al. 2006; 
Shen et al. 2007b). Based on numerical simulations of quasar 
light curves and prescriptions for the relation between quasar 
peak luminosity and host halo mass, Lidz et al. (2006) were 
able to show a weak luminosity dependence of quasar clus- 
tering at z 2 for intermediate luminosity ranges. They ar- 
gue that the peak luminosity is a better tracer of the host halo 
mass than the instantaneous luminosity, although in practice 
one can only observe the instantaneous luminosity. 

An alternative approach is to study quasar clustering as a 
function of black hole mass, which might better correlate with 
host halo mass than luminosity (e.g., Ferrarese 2002; Baes 
et al. 2003). Unfortunately for most quasars, the only esti- 
mates of their BH mass are virial BH masses (e.g., McLure & 
Jarvis 2002; Vestergaard & Peterson 2006; Shen et al. 2008c), 
which are based on broad line widths and continuum lumi- 

Luminous quasars are sparse enough such that most of the halos only 
host one quasar. Most of the multiple quasars in single halos are missed in 
the SDSS spectroscopic sample due to fiber collisions (see but the frac- 
tion of such companion quasars gives a negligible contribution to the quasar 
luminosity function as revealed in dedicated binary quasar surveys (e.g., Hen- 
nawi et al. 2006; Myers et al. 2008). Therefore no detailed halo occupation 
distribution modeling is needed to connect the halo abundance to the quasar 
luminosity function through the duty cycle. On the other hand, this simple 
halo model assumes that halo mass is the only parameter that determines the 
clustering properties of dark matter halos. Recently it has been suggested that 
halo clustering also depends on the assembly history and recent merger activ- 
ity (e.g., Gao et al. 2005; Wechsler et al. 2006; Furlanetto & Kamionkowski 
2006; Wetzel et al. 2007); such effects will comphcate the interpretation of 
quasar clustering measurements (e.g., Wyithe & Loeb 2008). Nevertheless 
we neglect these complications in the current work, since the magnitude of 
such effects is still quite uncertain at this moment. 



nosities measured from single-epoch spectra. Substantial un- 
certainties and biases exist for these estimates (e.g., Shen et 
al. 2008b), and any potential difference in clustering may 
be washed out by these complications (see the discussion in 
^4.2b . Nevertheless, we will use these virial BH mass es- 
timates to test any dependence of quasar clustering on BH 
mass, which in turn can offer some insight on the accuracy of 
these virial mass estimates. 

There are also sub-classes of quasars that show characteris- 
tic properties different from the "regular" quasar population. 
Quasars have a distribution of colors at a given redshift. Us- 
ing a sample of 4576 uniformly-selected quasars from SDSS, 
Richards et al. (2003) found that at each redshift the g-i color 
follows an approximate Gaussian distribution with a red tail, 
and they proposed to use the deviation from the median g - i 
color as a redshift-independent color indicator Thus quasars 
can be divided into blue, red and reddened populations (e.g., 
Richards et al. 2003), where the reddened population is most 
likely dust-reddened in the quasar host galaxy (e.g., Hopkins 
et al. 2004). There are also significant differences (at least 
in the ensemble average) between the emission line proper- 
ties for blue and red quasars (e.g., Richards et al. 2003). 
Moreover, a small fraction (< 10%) of quasars are radio-loud, 
although they have similar optical properties to radio-quiet 
quasars. It will be interesting to study their clustering prop- 
erties for these quasar sub-classes to see if their large-scale 
environment is responsible for their different properties. 

The purpose of this paper is to investigate various de- 
pendences of quasar clustering using a homogeneous spec- 
troscopic sample drawn from the SDSS Fifth Data Release 
(DR5, Adelman-McCarthy et al. 2007). In particular- we fo- 
cus on the dependences on luminosity, virial BH mass, quasar 
color and radio loudness. The redshift evolution of quasar 
clustering/bias factor and redshift distortions at z < 2.2 are re- 
ported in a companion paper (Ross et al. 2008). In ^we de- 
scribe our sample. We present clustering measurements in ^ 
and discuss our results in ^ We adopt a flat ACDM cosmol- 
ogy with cosmological parameters consistent with the three 
year WMAP observations (Spergel et al. 2007): Q.m = 0.26, 
f^A = 0.74, h = 0.7, ilb = 0.0435, n, = 0.95, erg = 0.78. Comov- 
ing distance will be used throughout unless otherwise speci- 
fied. 

2. DATA 

The SDSS uses a dedicated 2.5-m wide-field telescope 
(Gunn et al. 2006) with a drift-scan camera with 30 2048 x 
2048 CCDs (Gunn et al. 1998) to image the sky in five broad 
bands {ugriz; Fukugita et al. 1996). The imaging data are 
taken on dark photometric nights of good seeing (Hogg et 
al. 2001), are calibrated photometrically (Smith et al. 2002; 
Ivezic et al. 2004; Tucker et al. 2006) and astrometrically (Pier 
et al. 2003), and object parameters are measured (Lupton et 
al. 2001; Stoughton et al. 2002). Quasar candidates for spec- 
troscopy are selected from the imaging data using their col- 
ors (Richards et al. 2002), and are arranged in spectroscopic 
plates (Blanton et al. 2003) to be observed and confirmed with 
a pair of double spectrographs. 

The SDSS spectroscopy prohibits two targets closer than 
55" to be assigned spectroscopic fibers on one plate due to 
fiber collision. Although serendipitous close pairs can be ob- 
served on overlap plates (Hennawi et al. 2006), their com- 
pleteness is poor in the SDSS main quasar sample. Fig. 1 
in Ross et al. (2008) shows the minimal projected comoving 
separations corresponding to the 55" fiber collision scale as 
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function of redshift. These are the smallest scales that can be 
systematically probed by the SDSS spectroscopic quasar sam- 
ple at each redshift. Investigations on quasar clustering below 
these scales can only be achieved with other dedicated quasar 
samples (e.g., Hennawi et al. 2006; Myers et al. 2008). 

2.1. Sample Selection 

We take the DR5 spectroscopic quasar catalog (Schneider 
et al. 2007) as our parent sample from which we draw our 
clustering samples. This catalog contains 77,429 bona fide 
quasars that have luminosities larger than M,- = -22.0 (us- 
ing a slightly different cosmology in that paper) and have 
at least one broad emission line (FWHM > 1000 km s"') or 
have interesting/complex absorption features. About half of 
the quasars in this catalog were selected from a uniform al- 
gorithm (as described in Richards et al. 2002) implemented 
after DRl (Abazajian et al. 2003), which is flux limited to" 
/ = 19.1 at z < 3 and / = 20.2 at z > 3. There are a few 
(< 2%) ; > 19.1 quasars at z < 3 which were selected by the 
high-z igriz) branch of the targeting algorithm (Richards et al. 
2002); we reject these objects when constructing the homoge- 
nous clustering sample. Our final clustering sample thus in- 
cludes all uniformly-selected quasars with limiting magnitude 
/ =19.1 at z < 2.9 and / = 20.2 at z > 2.9. This homogeneous 
clustering sample includes 38,208 quasars covering the red- 
shift range 0.1 < z < 5.3; the angular coverage of this sample 
is identical to the high-z (z > 2.9) quasar sample studied in 
Shen et al. (2007a, the all sample), and covers a solid an- 
gle of ~ 4000 deg^. We do not consider further refinements 
of the sample, such as using only good imaging fields as in 
Richards et al. (2006) and Shen et al. (2007a), because at 
low z, quasars are found by UV excess, and are not as subject 
to subtle problems in the photometry as are high-z quasars, 
and we generally found consistent results with and without 
including bad fields (e.g., see appendix C in Ross et al. 2008). 
Although we may gain better control over systematics, such 
refinements will reduce the number of quasars in each sub- 
sample and therefore decrease the S/N for clustering measure- 
ments. The entire clustering sample is shown in the left panel 
of Fig. [T]in the space of redshift and /-band absolute mag- 
nitude (/T-corrected to z = 2, see below and Richards et al. 
2006). This homogeneous quasar sample is also used in the 
companion paper (Ross et al. 2008), where it is referred to as 
the UNIFORM sample. 

We divide our sample into six redshift bins: [0.1,0.8], 
[0.8, 1.4], [1.4,2.0], [2.0,2.5], [2.9,3.5], and [3.5,5.0], where 
we have deliberately avoided straddling two redshift deserts 
of quasars at z 2.7 and z ^ 3.5, caused by the inefficiency 
of quasar color selection at these redshifts due to the con- 
tamination from F and GK stars (Fan 1999; Richards et al. 
2002; see also fig. 1 in Richards et al. 2006). Note that 
these redshift bins are different from those used in Ross et al. 
(2008). While the redshift evolution of quasar clustering is 
not the focus of this paper, we measure quasar clustering for 
these redshift bins independently from Ross et al. (2008) as 
a consistency check. Within each redshift bin, we further di- 
vide the sample according to luminosity or virial black hole 
mass, where we take the virial mass estimates from Shen et 
al. (2008b). These redshift grids are shown in Fig. [T|for L-z 
(left) andMBH.vii -z (right). We use the A'-corrected absolute 
/-band magnitude M,(z = 2) normalized at z = 2 (e.g., Richards 

' ' All magnitudes in this paper are corrected for Galactic extinction using 
the Schlegel, Finkbeiner & Davis (1998) map. 



et al. 2006) as a luminosity indicator. The conversion between 
this non-standard absolute magnitude and M,(z = 0) is given 
by equation (1) in Richards et al. (2006). For a universal 
power-law spectral energy distribution (SED) with a^, = -0.5 
we simply have M,(z = 0) = M,(z = 2) + 0.5964. There is also 
a tight correlation between M,(z = 2) and the bolometric lumi- 
nosity Lboi estimated from the quasar spectrum, as shown in 
fig. 2a of Shen et al. (2008b). This correlation is described 
byi2: 

M,(z = 2) = 90-2.51og(Lboi/ergs-i) , (1) 

where the Gaussian scatter of the correlation is cr = 0.166. 

While the L-z and Mbh.vIi grids in Fig. [T|can be used 
to study the luminosity dependence and virial mass depen- 
dence at fixed redshift, the small number of objects in each 
bin will limit the quality of clustering measurements. We thus 
construct four other divisions using a larger redshift range 
0.4 < z < 2.5 to increase the signal-to-noise ratio, which are 
the following: 

(a) median luminosity (bright and faint samples). The sam- 
ple is divided by the median luminosity at each redshift, which 
is used to test luminosity dependence. In addition we also 
divide the sample into the top 10% luminous quasars and 
the remaining less luminous quasars, since if these bright- 
est quasars live in the rarest halos then their clustering will 
be much stronger than the less luminous ones because of the 
rapid increase of linear bias factor with halo mass (e.g.. Kaiser 
1984; Mo & White 1996). 

(b) median virial mass (high and low mass samples). The 
sample is divided by the median virial BH mass at each red- 
shift, which is used to test virial BH mass dependence. As 
in the luminosity case, we also divide the sample into the top 
10% massive quasars and the remainder 

(c) color division (reddened, red and blue samples). The 
sample is divided according to relative quasar colors. We use 
A(g- /), i.e., the deviation from the median g-i color at each 
redshift for our sample'^, as a color indicator (described in 
detail in Richards et al. 2003). The reddened quasars are clas- 
sified as those with A(g-/) > 1.5(t from the median value 
A(g - /) = 0, where a is the standard deviation of the dis- 
tribution of A{g-i)\ the red quasars are classified as those 
with < A(g-/) < 1.5(7; the blue quasars are those with 
A(g- /) < 0. These subsamples are used to test color depen- 
dence. 

(d) radio division ( radio and non-radio samples). The sam- 
ple is divided into FIRST-detected and undetected sources 
using the target selection information included in the DR5 
quasar catalog. The FIRST-detected and undetected quasars 
within this redshift range 0.4 < z < 2.5 are complete to / = 
19. 1 in our uniform sample. The FIRST-detected sample con- 
tains quasars that are generally radio-loud with typical radio- 
loudness TZ = f(, cm//2500 ^ 100 (Jiang et al. 2007). These 
subsamples are used to test radio dependence. To simplify 
the discussion we use FIRST-detected/undetected and radio- 
loud/quiet interchangeably in the following sections, but we 
acknowledge that they are two different concepts. Note that 
the flux limit of the FIRST survey is bright enough that mod- 
erately radio-loud objects {R ^ 10) may be undetected. 

A fixed slope of b = -2.5 is used in fitting Mi(7. = 2) = a + 
/)log(Lboi/ergs"'). If we do not fix b then the best fitted parameters are 
<3 = 94 ± 11 , i) = — 2.58 it 0.23, which is fully consistent with a nominal slope 
b = -2.5 between luminosity and magnitude. 

Note these A(g — i) values are re-calculated for our uniformly-selected 
sample, not those tabulated in the Schneider et al. (2007) catalog. 
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Since all the four divisions include quasars from a wide red- 
shift range, it is crucial to ensure that the redshift distributions 
are similar for whichever subsamples are compared. The bot- 
tom panels in Figs. |2}21show that our choice of subdividing 
these samples yields almost identical redshift distributions for 
comparing subsamples. The disadvantage of these divisions is 
that they intermix quasars of different luminosity, mass etc., at 
various redshifts. However, if the dependences of clustering 
on these properties are monotonic and do not evolve signifi- 
cantly over the redshift range 0.4 < z < 2.5, we still expect to 
see differences in the averaged sense. 

2.2. The Two-Point Correlation Function 

The simplest way to measure clustering is the redshift space 
correlation function where s is the redshift space separa- 
tion of quasar pairs. To minimize the effects of redshift dis- 
tortions and redshift errors, the projected correlation function 
Wp{rp) is often used (e.g., Davis & Peebles 1983), where rp is 
the projected separation. In our clustering studies, both ^,(5) 
and Wpifp) will be presented. To compute ^^(.s) and Wpijp), we 
use the Landy-Szalay estimator (Landy & Szalay 1993). For 
error estimation we use jackknife resampling as described in 
Shen et al. (2007a). Only the diagonal elements in the covari- 
ance matrix are used in the fitting because we found the co- 
variance matrix is generally too noisy; on the other hand, we 
find consistent results with or without including off-diagonal 
elements in the fitting (Shen et al. 2007a; Appendix B of 
Ross et al. 2008). We refer to our previous paper (Shen et al. 
2007a) and Ross et al. (2008) for more detailed descriptions 
on the estimators used, error estimations and generating ran- 
dom catalogs. In some cases, the sample is too sparse to make 
useful auto-correlation measurements, so we cross-correlate it 
with a larger quasar sample matched in redshift to boost the 
clustering signal (e.g.. Coil et al. 2007; Shen et al. 2008a; 
Padmanabhan et al. 2008). 

Although our sample represents the largest spectroscopic 
quasar sample to date, we are limited by shot noise, because 
our sample is much more sparse than low redshift galaxy sam- 
ples (e.g., Zehavi et al. 2005; Padmanabhan et al. 2007), 
type 2 AGN samples (e.g., Constantin & Vogeley 2006; Li 
et al. 2006), or photometric quasar samples (e.g., Myers et 
al. 2006, 2007a, 2007b). The volume number density of the 
SDSS main quasar sample is even lower than the number den- 
sities of the 2QZ (Croom et al. 2004) and the 2SLAQ (Croom 
et al. 2008) spectroscopic samples studied by several authors 
(e.g., Porciani et al. 2004; Croom et al. 2005; da Angela et 
al. 2005; da Angela et al. 2008), because SDSS does not 
reach as far down the quasar luminosity function as the other 
surveys. The sparseness of the sample directly impacts our 
strategy of measuring the correlation function and estimating 
its error In particular, only a handful of quasar pairs will be 
present for the smallest projected rp bins when we compute 
Wp, which not only makes the results noisy but also puts the 
usage of Wp into question. Binning effects may also become 
important for these sparse-pair bins. For this reason, we de- 
velop a Maximum-Likelihood method in ^3.21 which we use 
to fit the smallest scales. 

In addition to statistical issues with our sample, there might 
also be some systematics that are difficult to quantify. As al- 
ready mentioned earlier, near redshifts z ^ 2.1 and z 3.5 
quasars are close to the stellar locus in color-color space, 
and the selection efficiency drops rapidly. The number of 
observed quasars near these redshifts is reduced and pairs 



with similar colors are preferentially lost, hence the corre- 
lation function measurements suffer. Both statistical fluctu- 
ations and systematic issues can lead to negative correlations 
at certain scales. Including or excluding these negative bins 
in the power-law model fitting sometimes yield quite differ- 
ent results. In addition, the jackknife errors we are using may 
underestimate the real errors for small number statistics. For 
all these reasons, we caution that the actual uncertainties in 
our clustering measurements may be larger than our formal 
estimates. 

3. CLUSTER™G MEASUREMENTS 
3.1. Binned Correlation Functions 

In this section we use the traditional binned correlation 
function and power-law models to measure the clustering 
properties. We use an alternative un-binned, maximum- 
likelihood approach in ^3.21 for small to intermediate scales 
probed in our sample. 

Before we continue, we want to clarify several issues with 
fitting a single power-law model of ^j(.s) = (s/sq)"^' or f(r) = 
('"/'"o)"^ to the raw CF data points. First of all, the underlying 
CF is not a perfect power-law at all scales; even if it were, 
because of the statistical fluctuations of CF data for sparse 
samples, the best-fit parameters of power-law models would 
depend on the fitting range. Hence the best-fit parameters 
should be quoted with fitting ranges. Secondly, for most of 
our subsamples the data are of insufficient quality for a flex- 
ible power-law-index fit, hence we have fixed the power-law 
indices to be 7, = 1.8 and 7 = 2, close to the observed values 
(e.g., Porciani et al. 2004; Croom et al. 2005; Shen et al. 
2007a). Thirdly, occasionally some CF data points are neg- 
ative due to statistical fluctuations or unknown systematics; 
including those negative points are necessary, but, because of 
these outliers and the nature of power-law models, the ^ fit 
will have high model rejection probability. Hence we perform 
fits both with and without negative CF data points within the 
fitting range. For most of the cases the resulting correlation 
lengths are consistent within Icr errors, but there are excep- 
tions. Given the possibility that some of the negative points 
might be caused by systematic effects (i.e., close to the red- 
shift deserts or at characteristic scales of preferentially los- 
ing pairs), we believe the actual clustering strength should lie 
somewhere in between these two limits. 

Quasars are biased tracers of the underlying dark matter. 
The linear scale-independent bias factor is normally defined 
as = ^Qso/^ji where ^m(z) is the matter correlation func- 
tion, extrapolated to high redshift according to the linear 
growth model. We compute using a linear CDM power 
spectrum with transfer function from Eisenstein & Hu (1999), 
and estimate b using the integrated correlation function to 
marginalize nonlinear effects and scale-dependent bias'"* 

ii^= ^ f '^'\{r)r'dr , (2) 

where we choose r^^ = 5 /!~'Mpc and r^ax = 20 /i"'Mpc (note 
these choices are slightly different from Ross et al. 2008). 
We use the best-fit power-law models to compute the inte- 
grated ^20 for quasars, and estimate the matter correlation 
function at the median redshift of the quasar sample. For 

We find that the bias calculated in this way is only slightly larger than 
the one estimated at r = 20 /i^'Mpc by ~ 5%, below the uncertainty from the 
clustering measurement itself (> 10%; see Table 1). 
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cross-correlation functions with power-law models and fixed 
power-law index 7 = 2, we simply have 

'"0,2 = '•o.cross/'"o,i , bi = bl^^Jbi , (3) 

where subscript "cross" refers to the cross-correlation and 1, 
2 refer to auto-correlations of sample 1 and 2. Note that with 
equation (|3]l we have implicitly assumed that both samples 1 
and 2 are perfectly correlated with each other, i.e., they both 
trace the underlying DM distribution exactly. 

3.1.1. Redshift Evolution 

In the companion paper (Ross et al. 2008) we study the 
redshift evolution of quasar clustering in detail. Here we only 
provide brief discussion on this topic. 

There have been several studies of the redshift evolution 
of quasar clustering, based on large survey samples such as 
2QZ, SDSS, and 2SLAQ (e.g., Croom et al. 2005; Myers et 
al. 2006, 2007a; Porciani & Norberg 2006; Shen et al. 2007a; 
da Angela et al. 2008; Ross et al. 2008). While the redshift 
evolution at z < 2.2 is still controversial to some extent (cf. 
Porciani & Norberg 2006; da Angela et al. 2008; Ross et al. 
2008) owing to the uncertainties in the clustering measure- 
ments for individual redshift bins, high redshift (z > 3) SDSS 
quasars have much stronger clustering than their low redshift 
(z < 2.2) counterparts (Shen et al. 2007a), reflecting the large 
bias of their host halos. 

We have performed independent clustering measurements 
using our redshift bins indicated in the left panel of Fig. [T] 
but with all the quasars in each redshift bin, i.e., no luminos- 
ity cut. The best-fit values of correlation length and linear 
bias are tabulated in Table [T] Our results are in good agree- 
ment with those of Ross et al. (2008). We see a slight trend 
of increasing clustering with redshift at z < 2.5, but the uncer- 
tainties are large in each redshift bin. 

3.1.2. Luminosity Dependence 

At each redshift our sample spans a range in luminosity 
(typically 3 magnitudes). Hence the next question is whether 
or not quasar clustering depends on luminosity. On theoret- 
ical grounds, if quasar luminosity perfectly correlates with 
host halo mass, we expect that higher-luminosity quasars live 
in more massive halos and therefore are more strongly clus- 
tered. Weak dependence of quasar clustering on luminosity 
would imply that a quasar's luminosity is not a good tracer of 
its halo mass - there is substantial scatter in L-M^h and/or 
MBH-A^haio correlations (e.g., Lidz et al. 2006). Observa- 
tionally an important caveat is that the errors on the clus- 
tering measurements must be small enough that any differ- 
ence caused by varying host halo mass is discernible. No 
strong luminosity dependence has yet been reported (e.g., 
Adelberger & Steidel 2005; Porciani & Norberg 2006; Myers 
et al. 2007a; da Angela et al. 2008); but most of the studies 
are for z < 2.2 and the error bars are large. 

We attack this problem by first examining the 0.4 < z < 2.5 
samples divided by the median luminosity at each redshift 
02. II and the top-left panel of Fig. |2]i, which offers the best 
signal-to-noise ratio, although it mixes objects at different 
redshifts and different luminosities. The correlation function 
for the bright and faint quasar samples are shown in the upper 
two panels in Fig. |5] The projected correlation functions are 
consistent with one another given the errors. 

We then compare the results for the 10% most luminous 
quasars and the remainder, as shown in the bottom panels in 



Fig. |5] where we use cross-correlation for the 10% most lumi- 
nous quasars with the remaining 90% less luminous quasars. 
We detect stronger clustering strength for the most luminous 
quasars at the ^ 2a level (e.g., see Table [T]i. This is some- 
what expected since the halo bias factor increases rapidly with 
mass. If these brightest quasars live in the rarest and most 
massive halos, their clustering will be appreciably stronger 
than the fainter quasars, an effect that is detectable even with 
our current sample size. 

Finally we compare the clustering in each luminosity- 
redshiftbin as indicated in the left panel of Fig. [T] since the lu- 
minosity dependence of clustering may evolve with redshift. 
The results are shown in Fig. |6] Given the large uncertain- 
ties, no appreciable luminosity dependence is observed in any 
of the redshift bins. The comparison of different luminosity 
samples becomes increasingly difficult at high redshift, and 
hence we defer the discussion of the luminosity dependence 
of clustering at z > 2.9 to ^3.21 

For the overall quasar population, one can also com- 
pare the clustering amplitudes between our sample and the 
2QZ/2SLAQ samples (e.g, Porciani et al. 2004; Croom et al. 
2005; Porciani & Norberg 2006; da Angela et al. 2008) or the 
SDSS photometric quasar samples (e.g., Myers et al. 2006, 
2007a). For the same redshift ranges at z ^ 2.2, all samples 
have comparable clustering amplitude within the error bars. 
The average luminosity is the highest for our SDSS spectro- 
scopic quasar sample and the lowest for the 2SLAQ sample 
(differing by a factor of ~ 5), which also indicates weak lu- 
minosity dependence of quasar clustering at z < 2.2 for all but 
the highest luminosities. We discuss the implications of this 
weak luminosity dependence at z < 2.5 in ^ 

3.1.3. Virial Mass Dependence 

We have divided the sample in virial BH masses using the 
measurements from Shen et al. (2008b). Quasars have a range 
of Eddington ratios at fixed BH mass (e.g., Kollmeier et al. 
2006; Shen et al. 2008b), thus the instantaneous quasar lu- 
minosity is not a good tracer of the BH mass. If BH mass 
is a superior tracer of its host halo mass, we might expect a 
BH mass dependence of quasar clustering. We test this pos- 
sibility using virial BH masses measured from single-epoch 
quasar spectra. However, we note that virial BH masses are 
not true BH masses (see the extensive discussion in Shen et 
al. 2008b). The scatter between virial and true BH masses 
may also dilute any potential clustering dependence, which 
we will return to in ^4.21 

As in the luminosity dependence case, we show the compar- 
ison of our 0.4 < z < 2.5 high mass and low mass samples in 
Fig. I2] While there is some indication that the high mass sam- 
ple has a stronger clustering amplitude in redshift space, it is 
completely consistent with no difference in the projected cor- 
relation function (see Table[T]for the best-fit ro values). How- 
ever, when the sample is divided into the top 10% massive 
quasars and the remainder, the clustering is stronger for the 
quasars with highest virial BH masses, as shown in the bot- 
tom panels of Fig. |2l The difference in the clustering strength 
for the cross-correlation of the 10% most massive quasars and 
the auto-correlation of the remaining 90% is significant at the 
^ 2(7 level (see Table[T]). 

Similarly we show the comparison between virial mass bins 
at each redshift bin (as indicated in the right panel of Fig. [U 
in Fig. |8] Again no appreciable mass dependence is observed, 
although the S/N is poor in most bins. 
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TABLE 1 

Measurements of the two-point correlation function 



Sample 


A'qso 


z 




[•«l,-?2] 


*0 7,,=1.8 


{rpA,rp,2\ 


ro\"/=2 


h 


ML [Ki,fi2] 


'-O.Ml|7„l=2 






















('/i^'Mnr'l 


All luminosities 






















0.1 <z<0.8 


7902 


0.50 


45.57 


[2,40] 


5.07 ±0.40 


[3,40] 


6.14±0.78 


1.32±0.17 


[2,15] 
[5,20] 


6 94+0-62 

7 7';+0.78 


0.8 <z< 1.4 


9975 


1.13 


46.27 


[2,150] 


6.21 ±0.59 


[3,80] 


8.16±0.79 


2.31 ±0.22 


[2, 15] 


65+^:§? 

''•-'^-1.14 


1.4<z<2.0 


11304 


1.68 


46.60 


[2,150] 


(6.01 ±0.60) 
7.35 ±0.55 


[3,80] 


(7.76 ±0.80) 
8.51 ±0.76 


2.20 ±0.23 
2.96 ±0.26 


[5,20] 
[2,15] 


y g2+0 M 
='•-"-0.98 












(6.94 ±0.56) 




(7.16±0.77) 


2.49 ±0.27 


[5,20] 


2.0<z<2.5 


3828 


2.18 


46.86 


[2, 150] 


8.36±1.38 
(7.40 ±1.42) 


[3,80] 


11.47±1.71 
(8.04 ±1.94) 


4.69 ±0.70 
3.29 ±0.79 


[2,15] 
[5,20] 


6.05+^-;^^ 
5 41+2-^1 


2.9<z<3.5 


2693 


3.17 


46.73 


[2, 150] 


10.36 ±1.82 
(8.82 ±1.86) 


[16,150] 


14.58 ±2.70 
(13.34±2.78) 


7.76 ±1.44 
7.10±1.48 


[2,15] 
[5,20] 


"•"-'-0.03 


3.5<z<5.0 


1788 


3.84 


47.07 


[16,150] 


22.10±3.44 
(17.05±3.61) 


[16,150] 


21. 04 ±3.39 
(16.00 ±3.68) 


12.96 ±2.09 
9.85 ±2.27 


[20,40] 
[2,15] 
[5,20] 


1 9 60+3-5'' 
17.54+3.58 

21 04+^'^^ 



2.9<z<3.5 

-27.3 <M, < -25.7 1262 3.17 46.67 [2,15] 0.00!|-33 

[5,20] Q-02^li 

[20,40] 21.92^-* 

-29.7 < M, < -27.3 1416 3.17 46.91 [2,15] ll-04!^29 

[5,20] 13.31!=:^;; 

[20,40] 12.60+'-*!? 



0.4<z<2.5 
bright 


15291 


1.40 


faint 


15228 


1.40 


10% most luminous-cross 


3050 


1.40 


90% least luminous 
10% most luminous-auto 


27469 
3050 


1.40 
1.40 


high-mass 


13414 


1.35 


low-mass 


13443 


1.35 


10% most massive-cross 


2698 


1.35 


90% least massive 
10% most massive-auto 
reddened-cross 


24159 
2698 
2505 


1.35 
1.35 
1.40 


red 


12697 


1.40 


blue 

FlRST-cross 


15317 
2173 


1.40 
1.30 


non-FIRST 


28346 


1.40 


FlRST-auto 


2173 


1.30 



46.56 


[2,115] 


7.71 ±0.67 






(6.38 ±0.66) 


46.31 


[2,115] 


8.11 ±0.64 






(7.60 ±0.64) 


46.84 


[2,115] 


7.78 ±0.82 


46.39 


[2,115] 


6.39 ±0.37 


46.84 






46.51 


[2,115] 


8.25 ±0.55 






(7.36 ±0.56) 


46.37 


[2,115] 


5.70±0.68 






(5.56 ±0.68) 


46.64 


[2,115] 


8.62 ±0.87 






(8.23 ±0.88) 


46.42 


[2,115] 


6.53 ±0.44 


46.64 






46.31 


[2,115] 


6.08 ±0.94 






(5.31 ±0.98) 


46.41 


[2,115] 


7.71 ±0.80 


46.43 


[2,115] 


7.68 ±0.60 


46.38 


[2,115] 


8.91 ±0.87 






(8.68 ±0.88) 


46.42 


[2,115] 


6.91 ±0.36 






(6.64 ±0.36) 


46.38 







[3,115] 


7.19± 1.06 


2.26 ±0.33 




(6.42 ± 1.10) 


2.02 ±0.35 


[3,115] 


6.86 ±0.75 


2.16±0.24 


[3,115] 


9.53 ±0.80 


3.00 ±0.25 




(8.72 ±0.82) 


2.75 ±0.26 


[3,115] 


7.06 ±0.46 


2.22 ±0.14 




12.86±2.32 


4.05 ±0.73 




(10.77 ±2.14) 


3.39 ±0.67 


[3,115] 


8.44 ±0.82 


2.61 ±0.25 




(7.78 ±0.85) 


2.40 ±0.26 


[3,115] 


8.00 ± 1.00 


2.47 ±0.31 




(7.29± 1.01) 


2.25 ±0.31 


[3,115] 


8.73 ±0.88 


2.70 ±0.27 


[3,115] 


7.10±0.57 


2.19±0.18 




10.73 ±2.33 


3.32 ±0.72 


[3,115] 


8.10± 1.20 


2.55 ±0.38 




(6. 10 ± 1.36) 


1.92 ±0.43 


[3,115] 


9.17±0.93 


2.89 ±0.29 




(8.42 ±0.95) 


2.65 ±0.30 


[3.115] 


7.81 ±0.71 


2.46 ±0.22 


[3,115] 


9.61 ±0.85 


2.91 ±0.26 




(9.18±0.86) 


2.78 ±0.26 


[3,115] 


7.12±0.50 


2.24 ±0.16 




12.97 ±2.47 


3.93 ±0.75 




(11.84±2.37) 


3.59 ±0.72 



Note. — Tabulated values of the fitted correlation functions for various samples. All length scales are in units of h Mpc. The fitting ranges are shown in 
"[ ]", and in "( )" we show the fitting results including negative data points (see the discussion in the text). The Icr en^or 5h in h is just Si, = h(5ro / ro) for the fixed 
power-law 7 = 2 models and neglecting the theoretical uncertainty in the matter correlation function 5;„(r,z). 



3. 1 .4. Color Dependence 

Quasars have different colors, either intrinsic or dust red- 
dened. As discussed in ^2.11 we have followed the approach 
of Richards et al. (2003) to divide our quasar sample by me- 
dian g-i color excess A(g-i) for 0.4 < z < 2.5, which re- 
moves the mean color at each redshift [e.g., division (c) in 
^ see Fig. [3). 

The red and blue quasar samples are sufficiently large to 
perform auto-correlations, and we show the results in the up- 
per two panels of Fig. |9] The red and blue quasars are con- 



sistent with having similar clustering properties. The red- 
dened quasar sample is too sparse to do auto-correlation, so 
we cross-correlate the blue+red quasar sample with the red- 
dened quasar sample, and show the results in the bottom two 
panels of Fig. |9] The reddened quasars have similar clus- 
tering amplitude. Thus our data indicate there is weak or no 
dependence of quasar clustering on colors. 

3.1.5. Radio Dependence 
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Finally we investigate if there is any dependence of quasar 
clustering on radio activity. Using our 0.4 < z < 2.5 FIRST- 
detected and undetected samples as indicated in Fig. H] 
we show their clustering comparison in Fig. [TOl where we 
use auto-correlation for the FIRST-undetected quasars and 
cross-correlate them with the FIRST-detected quasars. The 
FIRST-detected quasars are appreciably more strongly clus- 
tered (cross correlation length ro = 9.61 ±0.85 /!~'Mpc) than 
the FIRST-undetected quasai's (ro = 7.12 ±0.50 h'^Mpc), 
which indicates that radio quasars live in more massive dark 
matter halos. These FIRST-detected quasars also tend to have 
systematically larger virial BH masses than those FIRST- 
undetected quasars by 0.12 dex (e.g., Shen et al. 2008b). 
However, the difference in their clustering remains when we 
consider a radio-undetected quasar sample with virial masses 
matched to those of the radio sample. Thus more massive host 
halos, and denser environments may be related to the trigger- 
ing of radio activity. 

3.2. A Maximum-Likelihood Approach 

At the smallest separations, there is only a handful of pairs 
distributed among the bins, hence binning effects may become 
important, and the usage of the projected correlation function 
Wp(rp) becomes questionable. Here we design a Maximum- 
Likelihood (ML) method to use in this situation. Poisson 
statistics are assumed to apply, hence we only focus on the 
smallest bins where the pairs are statistically independent, i.e., 
the total number of pairs within each bin is much less than the 
total number of quasars in the sample. 

We take a power-law model for the underlying real space 
correlation function ^(r) = (r/ro.ML)"^"''- Then we compute 
the expected numbers of quasar-quasar and random-quasar 
pairs for a specified quasar sample, within a comoving cylin- 
drical volume with projected radius R to R + dR and half- 
height AH = 30 /!"'Mpc; the usage of a cylindrical volume 
is to minimize the effects of redshift distortions and errors, 
and we found our results are insensitive to the value of AH. 
In mathematical form, the expected number of quasar pairs 
within the cylindrical annulus volume is: 

/AH 
[ 1 + ^(r)]dH xNq = 2iTRh{R)dR , (4) 

where r = {R^+H^y^^, Nq is the number of quasars included 
in the sample under consideration, and 2TiRg{R)dR is the 
number of random-quasar pairs within interval dR divided by 
the height 2AH, averaged over Nq quasars. To compute'^ 
g(R) we have used the number density of quasars of the sam- 
ple under study as a function of redshift and the polygons 
which define the angular geometry of our sample (see ap- 
pendix B of Shen et al. 2007a; and Hamilton & Tegmark 
2004 for the description of spherical polygons). 

Following Croft et al. (1997, also see Stephens et al. 1997) 
we write the likelihood function as 

N 

where /x,- = IwRihiRddR, the expected number of pairs in the 
interval dR, and the index j runs over all the elements dR 
in which there are no pairs. Defining the usual quantity S = 

Note that in computing g(R) we have divided by a factor of 2 because 
we are calculating the number of pairs, not the number of companions. 



-21n£ we have 

5 EE -2 In £ = 2 / 2TTRh(R)dR - 2 V ln[h(Ri)] , (6) 

Jr... 

with all the model independent terms removed. Here the sum- 
mation is over all quasar pairs we found in our sample, 
and [Rmm,Rmix] is the range of scales over which we search 
for quasar pairs. The values of R,^in and Rmax are chosen to 
avoid fiber collision effects and to ensure that the pairs found 
are statistically independent, i.e., R^i„ > 2 /i"'Mpc and gener- 
ally /?max ^ 20 /!"'Mpc. But we find that different ranges of 
[RmimRmsLx] do yield different results, which do not always 
overlap within la. Hence we examine different ranges of 
[7?,nin,7?n,ax] for cach Sample. We then minimize S with re- 
spect to ro ML and obtain its la uncertainty. Again, because of 
the noisy data we fix 7ml = 2 in our fitting procedure. 

Compared with binned CFs, the ML procedure has several 
drawbacks. The information from pairs with large separations 
(R > ^max) is not used, and the merits of the ML method suffer 
if there is a paucity of pairs due to systematic effects (see 
below). For these reasons, we only apply this ML method and 
compare the ML estimates with our binned CF results as a 
consistency check (see Table[T|. 

We start with the two luminosity bins at 2.9 < z < 3.5 
shown in the left panel of Fig. [T] In Fig. [TT]we show the 
binned CFs for these two luminosity bins. On small scales 
(rp < 15 /i~'Mpc), an apparent deficit of correlation is seen 
for the fainter bin with respect to the brighter bin. On larger 
scales, the signal-to-noise ratios of the CFs are too small to 
distinguish any difference in clustering strength. For the ML 
approach we choose a comoving volume of a cylinder an- 
nulus with half-height AH = 30 /i"'Mpc and [/?min,^max] = 
[2, 15] /r'Mpc, [5,20] /i-'Mpc, and [20,40] /i-'Mpc. The best 
fit ML correlation lengths are listed in TablelT] which show no 
detectable correlations at small scales (r^ < 15 /r'Mpc) for 
the fainter bin, consistent with the binned CF results. 

The relative paucity of small-separation pairs in the fainter 
2.9 < z < 3.5 luminosity bin is much stronger than seen in the 
z < 2.9 luminosity bins. However, we suspect this is at least 
partly caused by some not well-understood systematic effects 
in our sample. The lower redshift cut z = 2.9 is close to the 
redshift desert at z ~ 2.7 where the quasar selection efficiency 
drops. The completeness becomes worse close to the magni- 
tude cut / = 20.2 at z ~ 2.9, as seen in our sample (also see 
fig. 17 of Richards et al. 2006). As we argued above, image 
quality becomes important in selecting quasars close to the 
faint magnitude cut, and preferential loss of quasar pairs with 
similar colors is possible. Therefore we do not claim that the 
apparent weaker clustering of fainter quasars at 2.9 < z < 3.5 
and at scales r < 20 /!"'Mpc is significant. On the other hand, 
our sample is unable to give clear evidence of a dependence of 
clustering on luminosity at large scales at z > 2.9 because of 
the sample sparseness. We hope that ongoing and future sur- 
veys of high-redshift quasars at the faint luminosity end can 
help resolve these issues. 

We now apply the ML method to several subsamples in dif- 
ferent redshift bins, and compare the results with those us- 
ing binned method in Table [T] The ML method yields results 
consistent with the binned method within error bars. In fact, 
the difference in the best-fit values of ro.ML for various fitting 
ranges in the ML approach is fully reflected in the binned CF 
data points, i.e., we get small values of ro,ML for fitting ranges 
where the binned CF data points fall below the power-law fit. 
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4. DISCUSSION 

We now discuss the dependences (or lack thereof) of quasar 
clustering on various quantities that we have found. Our clus- 
tering measurements of various subsamples are summarized 
in Fig. [T2]in terms of the bias factor. The goal is to compare 
the observed clustering measurements with theoretical predic- 
tions; or alternatively, to use these measurements to constrain 
theoretical models, i.e., the form and scatter of various re- 
lations, such as relations of true BH mass versus host halo 
mass, luminosity versus true BH mass/host halo mass, and 
virial BH mass versus true BH mass, etc. We assume that halo 
mass is the only relevant quantity that correlates with cluster- 
ing strength. Hence the question in any theoretical model is 
how to populate halos with quasars of various physical prop- 
erties (luminosity, BH mass etc), given constraints from the 
halo abundance, the quasar luminosity function and quasar 
clustering. 

4. 1 . Luminosity versus Host Halo Mass 

In this subsection we outline the basic logic in modeling the 
luminosity dependence of quasar clustering. Suppose that at 
fixed host halo mass, the quasar luminosity follows a correla- 
tion given by: 



log L = C + a logMhaio + ctl 



(7) 



where C and a are constants, and (Tl is a Gaussian random 
deviate. For simplicity we have assumed here a power-law 
form with log-normal scatter, but more complex correlations 
are possible. This correlation ^ gives the probability distri- 
bution of V{L\Mh^io) as. 



P(L|Mhaio) = (27ra2)-i/2exp 



[logL-(C+ a logMhaio)]^ 
2al 



(8) 

Using Bayes's theorem we can derive the probability distribu- 
tion of host halo mass at fixed luminosity V(Miin\o\L), given 
our knowledge of the halo mass function n(Mhaio) and the 
halo duty cycle the probability of a halo hosting an ac- 
tive quasar: 

^(Mhalo \L) CX 7'(L|Mhalo)n(Mhalo)/o„(Mhalo) , (9) 

where the halo mass function n(Mhaio) is usually extracted 
from simulations. Generally there will be a Malmquist-type 
bias because of the scatter ai and the bottom-heavy halo mass 
function. But the exact magnitude of this bias as function of 
luminosity also depends on the halo duty cycle /on, which it- 
self may well be a function of halo mass and redshift, and is 
constrained via the quasar luminosity function: 

^ = J '^Mhalo7'(L|Mhalo)n(Mhalo)/o„(Mhalo) • (10) 

Therefore given the assumed correlation Eqn. Q, halo mass 
function, and /on constrained from the quasar luminosity 
function, we obtain the mass distribution of halos hosting 
quasars shining at any fixed luminosity L via eqn. 

We also must know the linear halo bias factor Zjhaio as func- 
tion of halo mass in order to make clustering predictions. This 
is somewhat problematic, as noted in White et al. (2008). Var- 
ious fitting functions for the halo bias exist in the literature, 
which differ in both magnitude and shape (e.g.. Mo & White 
1996; Jing 1998; Sheth & Tormen 1999; Sheth et al. 2001; 
Tinker et al. 2005; Basilakos et al. 2008). In practice one 
needs a halo bias formula that is tested against large numeri- 
cal simulation sets at the ranges of halo mass and redshift of 



interest (e.g., see the discussion in Shankar et al. 2008b). Fi- 
nally the effective bias factor of quasars at fixed luminosity is 
the average over all the halos that host them: 



(11) 



bdL)= / £/Mhalo^halo(Mhalo):P(Mhalo|i) 



which is to be compared with observations. 

The correlation in eqn. (|7| is analogous to a light bulb 
model in which a quasar shines at a more or less fixed lu- 
minosity (i.e., a fraction of the Eddington luminosity of its 
SMBH) when it is on, with a log-normal scatter ai in lu- 

5/3 

minosity. Theoretical models predict a scaling L cx M^i^^^ or 

L cx Mjj^jj^ depending on energy conservation (e.g.. Silk & 
Rees 1998; Wyithe & Loeb 2002) or momentum conservation 
(e.g.. King 2003) during self-regulated black hole growth. Re- 
cent measurements of the Mbh-o" relation (e.g., Tremaine et 

4/3 

al. 2002) favor the latter scaling, L cx M^i^^^, as revealed in 
some hydrodynamic simulations (e.g., Di Matteo et al. 2005). 
A more sophisticated, and perhaps more physically motivated, 
estimate for 7'(L|Mhaio) has been put forward by P. Hopkins 
and collaborators, based upon simulated quasar light curves 
and correlations between quasar peak luminosity and halo 
mass (e.g., Hopkins et al. 2005; Lidz et al. 2006). Simple 
phenomenological models, however, such as the one that uses 
eqn. (|7]i, may fit the observations equally well (e.g., Shankar 
et al. 2008a; Croton 2008; Shankar et al. 2008b). A full 
exploration of these models is beyond the scope of this paper 

To gain some crude sense of how our measurements com- 
pare with theoretical models, we use the diagnostic fig. 3 in 
Lidz et al. (2006), which shows predictions for the quasar 
bias as a function of luminosity at z = 2. Their models use 
the halo bias formula from Sheth et al. (2001). The dynam- 
ical range in the median luminosity of each bin probed by 
our sample is quite narrow, ^ 0.4 dex, compared to 1 dex 
for the 2QZH-2SLAQ sample used in da Angela et al. (2008). 
For our bright and faint quasar samples as defined in ^2.11 
and in the upper-left panel of Fig. |2l the median redshift is 
z= 1.4, and the median luminosities are Zboi = lO"*^'^^ ergs"' 
and lO"*^-" ergs"'. Note thatLboi = 10"*^'^ ergs"' corresponds 
to rest-frame B-band luminosity Lg ^ lO'^L© in Lidz et al. 
(2006). Below and around this luminosity, even the light bulb 
model in their fig. 3 (without any scatter a) gives weak lu- 
minosity dependence of quasar clustering. This is consistent 
with our measurements, as well as da Angela et al. (2008), 
who probe only a < 1 dex dynamical range in luminosity at 
redshift below ^2.5. It is also marginally consistent with the 
results at z 2.5 in Adelberger & Steidel (2005), whose dy- 
namical range in AGN luminosity is ~ 2.5 dex. 

When the scatter ai in eqn. (|7]) is incorporated, the 
weighted bias factor at fixed luminosity will drop due to the 
scatter from more abundant less massive, and less biased ha- 
los, but it can be fine-tuned to yield an even flatter luminosity 
dependence if, for instance, the halo duty cycle /on(A^haio) de- 
creases at an appropriate rate when Mhaio decreases. One must 
also correct the overall linear bias of quasars by adjusting the 
normalization of the correlation between luminosity and halo 
mass (Eqn. |7]), using constraints from the observed quasar 
luminosity function and the theoretical halo mass function, 
following the logical flow we just formulated above. At this 
stage, the majority of the measurements of the luminosity de- 
pendence of quasar clustering for moderate luminosities are 
consistent with theoretical predictions of weak dependence, 
but are unable to distinguish different models. Uncertainties 
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in the quasar luminosity function, halo bias factor, and halo 
mass function, which are all needed to make a self-consistent 
model, further obscure the situation. 

Since the halo bias factor increases very rapidly with mass, 
various models are sensitive at the high-luminosity end of 
quasar clustering. Unfortunately, previous studies were based 
on samples containing too few bright quasars to perform cor- 
relation analyses; cross-correlating those bright quasars with 
much larger galaxy samples is a promising approach (e.g., 
Adelberger & Steidel 2005; Padmanabhan et al. 2008), but 
statistically significant results are yet to come. We have 
shown for the first time that the most luminous quasars in- 
deed are more strongly clustered than their fainter counter- 
parts. The median luminosity for our top 10% most luminous 
quasar sample is Zboi = lO'*'''*'* ergs"', a factor of ^ 0.5 dex 
larger than the median luminosity for the remaining quasars. 
With this increment in luminosity, we are probing the rarest 
and most biased halos. Using the Sheth et al. (2001) halo 
bias formula, we estimate a halo mass ~ 1 - 2 x lO'^* /i~'Mo 
for the 10% most luminous quasars and-2xlO'2/2-iMQfor 
the remainder. 

4.2. Virial BH Masses versus Host Halo Mass 

We may also expect a correlation between host halo mass 
and black hole mass. There have been claims for such a cor- 
relation at low redshift, where the circular velocity is used as 
a surrogate for halo mass and the central velocity dispersion 
is used as a surrogate for BH mass (e.g., Ferrarese 2002; Baes 
et al. 2003), although significant uncertainty remains due to 
limited sample size and systematic effects; for instance, it is 
argued that this correlation is dependent on Hubble type (e.g., 
Courteau et al. 2007; Ho 2007). But for now let us neglect 
the systematic effects, and, analogous to eqn. (|7]i, write the 
correlation between central BH mass and halo mass as: 

log Mbh =C + (3 logMhalo + 0-BH , (12) 

where the constants C and /3 define the normalization and 
scaling of this correlation, and the random Gaussian devi- 
ate ctbh denotes the scatter of this correlation. Neglecting 
the scatter ctbh and taking /3 ^ 1.65 as measured in Ferrarese 
(2002), one dex in Mbh corresponds to ^ 0.6 dex in Mhaio- 

The virial masses we measure for our quasar sample are 
scattered from the true masses by ~ 0.3-0.4 dex (see discus- 
sion in Shen et al. 2008b). To use eqn. (fT2l i and its inverse re- 
lation we need a model to disentangle the intrinsic BH masses 
from the estimated virial BH masses. 

Shen et al. (2008b) proposed a simple model of the intrin- 
sic BH mass and Eddington ratio distributions which simulta- 
neously reproduces the quasar luminosity and virial BH mass 
distributions in different luminosity bins for SDSS quasars. In 
this model, the intrinsic BH mass has a power law distribution, 
and the Eddington ratio has a log-normal distribution at fixed 
intrinsic BH mass with a dispersion 0.4 dex (see §§4.3 and 4.4 
in Shen et al. 2008b for details). We follow the methodology 
outlined in §4.3 in that paper and use the model parameters for 
the Mgii virial mass relation in their Table 2 to simulate both 
true and virial BH masses and bolometric luminosities. These 
parameters are found to be redshift-independent (Shen et al. 
2008b). We then use equation ([TJ to simulate the /-band abso- 
lute magnitudes and impose the SDSS magnitude cut (/ = 19. 1 
at 0.4 < z < 2.5) until we achieve the same redshift distribu- 
tion of observed virial BH masses as shown in Fig. |2] 

Our simulated virial and true BH masses are shown in Fig. 
[T3l where they are divided using the same definition of our 



high/low mass subsamples in ^2.11 The simulated virial BH 
masses reproduce the distribution of observed virial masses 
well. The differences in the median values between these two 
subsamples are 0.4 dex and 0.2 dex for the virial and true 
BH masses respectively, i.e., there is not much distinction be- 
tween the true masses for the apparently high and low mass 
populations. The dynamical range in the median Mhaio is then 
only ^ 0.12 dex given our assumed simple model without the 
scatter ctbh- Take the typical host halo mass 2 x 10'^ h~^MQ, 
the expected difference in bias for the low and high mass sam- 
ples would be only 5%, well below our la uncertainties 
10%. Thus it is not surprising that we do not observe an ap- 
preciable difference in the clustering strength of the high and 
low mass quasar samples, given our current precision of the 
clustering measurements. As in the luminosity dependence 
case, when the scatter ctbh is included, the expected depen- 
dence could be even weaker 

4.3. Color and Radio Dependences 

Our clustering measurements of reddened, red and blue 
quasar populations show consistent results, with differences 
that are undiscernible given our uncertainty levels. A weak 
dependence on color is expected if the color of a quasar is de- 
termined by the physical conditions in the host galaxy of the 
SMBH more than by its large-scale environment. Also, there 
is no strong correlation of color with either luminosity or BH 
mass, indicating that this diversity in quasar color occurs at all 
mass scales that our sample probes. These reddened quasars 
are most likely dust-reddened in the immediate nuclear envi- 
rons and/or host galaxy of the quasar, as argued by Richards 
et al. (2003) and Hopkins et al. (2004), and there is a strong 
connection between dust-reddening and broad absorption line 
quasars (BALQSOs; e.g., Reichard et al. 2003). Reassuringly, 
there seems to be no appreciable difference between the clus- 
tering strength of BALQSOs and regular quasars either (Shen 
et al. 2008a). 

The most significant difference in clustering strength we 
have found is for the FIRST-detected quasar sample. The rel- 
ative bias factor of radio-detected to radio-undetected quasars 
is ^ 1 .6, using the correlation length values in Table 1 . This 
implies that, on average, radio-detected quasars live in more 
massive halos than radio-undetected ones. Using the formula 
in Sheth et al. (2001) for the halo bias factor, the estimated 
host halo mass is - 1 - 2 x 10'^ /i"'Mq for FIRST-detected 
quasars, and - 1.5-3 x 10'^ /j-'Mq for FIRST-undetected 
quasars, at median redshift z ~ 1.4. If one uses the Jing (1998) 
fitting formula for the halo bias factor instead (which gives the 
largest bias factor for fixed halo mass compared with others), 
the resulting halo masses are ~ 5 x 10'^- 1 x lO'-^ h~^MQ 
and - 1 - 2 x 10'^ h'^MQ for FIRST-detected and undetected 
quasars respectively. Accurate determinations of the host halo 
masses for the radio-loud and radio-quiet populations again 
require better clustering measurements and robust halo bias 
factor from simulations. The trend observed in this study is 
consistent with low redshift (z < 0.3) observations by Man- 
delbaum et al. (2008) based on clustering measurements (also 
see Lin et al. 2008 and Wake et al. 2008) and lensing maps of 
radio-loud type 2 AGN. Both studies find that for comparable 
virial BH masses (or galaxy properties), radio quasars/ AGNs 
reside in more massive halos than their regular counterparts, 
indicating that radio quasars/ AGN may follow a different BH- 
halo relation from the normal population. 

5. CONCLUSIONS 
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We have performed clustering measurements of quasars 
from a uniformly selected sample in the fifth data release of 
SDSS, and focussed on the dependences of quasar clustering 
on luminosity, virial BH mass, quasar color and radio loud- 
ness. 

Our main findings are the following: 

(i) Strong luminosity dependence of quasar clustering at 
z < 2.5 is not detected for typical quasar luminosities. 
While this is consistent with several theoretical predic- 
tions, we caution that the dynamical range in luminosity 
probed in our sample is narrow and our current sample 
size (especially when split in luminosity /redshift) is not 
yet large enough to yield accurate clustering measure- 
ments. However, the 10% most luminous quasars in 
our sample are more strongly clustered, which is likely 
caused by the fact that halo bias increases rapidly with 
mass at the high mass end. The final data release of 
SDSS-II (DR7, Abazajian et al. 2008) will double the 
spectroscopic quasar sample size and boost the cluster- 
ing signal by a factor of ~ 4, thus providing superior 
results over the current study. 

(ii) Our measurements show weak or no dependence of 
quasar clustering on virial BH mass at z < 2.5. This 
is at least partly due to the fact that virial masses are 
scattered around true BH masses, where only the latter 
are presumably correlated with host halo masses. The 
limited dynamical range in BH mass and our limited 
sample size are other barriers to detecting this mass de- 
pendence. However, even if we have accurate BH mass 
estimates, the dependence of quasar clustering on BH 
mass may still be weak if there is substantial scatter 
around the relation between BH mass and halo mass. 
Thus the combination of precise BH mass estimates 
(i.e., improved virial or new BH mass estimators) and 
high-quality clustering measurements in future surveys 
can be used to test how SMBHs grow in dark matter 
halos in the hierarchial structure formation framework. 

(iii) Blue, red and reddened quasars have similar clustering 
strengths, at least indistinguishable at the current lev- 
els of measurement uncertainty and dynamical range 
probed by our sample. 

(iv) Radio-loud quasars are more strongly clustered than 
are radio-quiet quasars. This implies that radio-loud 
quasars live in more massive dark matter halos and 
denser environments than radio-quiet quasars, consis- 
tent with local z < 0.3 observations for radio-loud type 
2 AGN (Mandelbaum et al. 2008) and radio galaxies 
(Lin et al. 2008; Wake et al. 2008). Radio quasars clus- 
ter more strongly than a regular quasar sample matched 
in virial BH mass, indicating that the BH-halo connec- 
tion might be different for the radio population. Mas- 
sive dark matter halos may provide the necessary hot 
medium needed for radio activity. 

Future quasar clustering analyses should aim at both 
broader dynamical ranges and larger sample sizes. Faint 
quasar detections are the focus of many ongoing or planned 
wide-angle survey projects such as BOSS (Schlegel et al. 
2007) and Pan-STARRS (Kaiser et al. 2002), which will both 
broaden the dynamical range and enlarge the sample size. Be- 
cause the halo bias factor increases rapidly at the high-mass 



end, the clustering of the most luminous and most massive 
quasars will provide the best test of theoretical models (e.g., 
Lidz et al. 2006; Hopkins et al. 2007). To obtain accurate 
measurements of quasar clustering at the high luminosity end 
one needs a much larger control sample to cross correlate with 
them; this control sample could be either a low luminosity 
quasar sample or a high redshift galaxy sample. 
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12 3 4 5 

Redshift 

Fig. 1 . — Distributions of quasars in the redsliift-luminosity (left) and redshift- virial mass (right) planes, for our uniform quasar sample. The superposed grids 
indicate the subsamples we use to explore dependences of clustering strength on luminosity and virial BH mass. In each bin, the number of quasars is marked, 
and the red filled circle marks the location of the median value. Note that these grids avoid straddling the redshift deserts at z ~ 2.7 and z ~ 3.5 due to quasar 
selection inefficiency (see text). 
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Fig. 2. — Sample divisions at median and 10% highest luminosities (upper panels) and virial masses (bottom panels), in the redshift range 0.4 < z < 2.5. In 
the bottom of each panel we show the normalized redshift distributions for each subsample. The similarity of their redshift distributions allows a fair comparison 
of their relative clustering strength. 
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Fig. 3. — Sample divisions using quasar color excess (Richai'ds et al. 2003), in the redshift range 0.4 < z < 2.5 (see ^2.U . In the bottom panel we show the 
normalized redshift distributions for each subsamples. The similarity of their redshift distributions allows a fair comparison of their relative clustering strength. 
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Fig. 4. — FIRST detected and undetected quasars. Histograms show the distributions in redshift and optical luminosity for the two samples. The radio fraction 
as a function of redshift is shown as open symbols in the bottom panel. 
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Fig. 5. — Luminosity depende nce o f quasar clustering, using the bright/faint samples (top panels) or the top 10% luminosity divided samples (bottom panels), 
with 0.4 < z < 2.5 as defined in j|2.1| For the top 10% most luminous quasars we have cross-correlated them with the remaining 90% quasars. Solid and dashed 
hnes are the best-fit power-law models excluding and including negative data bins in the fits, respectively (see text for details). Left: redshift space correlation 
function, right: projected correlation function. No appreciable difference in the clustering strength is observed for the median luminosity division. But the most 
luminous quasars show appreciably stronger clustering than the rest of the quasars. 
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Fig. 6. — Correlation functions in each redshift-luminosity bin as defined in the left panel of Fig. [T] The redshift ranges are marked on the top of each column 
and the luminosity [Mi(z = 2)] ranges are marked on the right of each row. Left: redshift space congelation function. Right: projected correlation function. In each 
panel, the dashed line is a power law model of ^.((.v) = (s/6.5 /i^'Mpc)"'^ or §(r) = (r/6.5 /r'Mpc)"^, drawn to guide the eye. No appreciable difference is seen 
in each luminosity bin at fixed redshift, due to large error bars in the correlation functions caused by small sample sizes. 
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Fig. 7. — Virial ma ss de pendence of quasar clustering, using the high/low-mass samples (upper panels) or the top 10% virial mass divided samples (lower 
panels) as defined in j|2.1| Solid and dashed lines are the best-fit power-law models excluding and including negative data bins in the fits, respectively. Left: 
redshift space correlation. Right: projected correlation function. The high/low-mass samples are consistent with no difference in clustering strength given the 
measurement errors. However, the most massive quasars are more strongly clustered than are the rest of the quasars. 
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Fig. 8. — Correlation functions in each redsliift-virial mass bin as defined in tlie right panel of Fig. [T] luminosity bin as defined in the left panel of Fig. 
[T] The redshift ranges are marked on the top of each column and the virial mass (logMgn.vir/^Q) ranges are marked on the right of each row. Upper: 
redshift space correlation function. Bottom: projected correlation function. In each panel, the dashed Une is a power law model of = {s/6.5 /i^'Mpc)"' or 
^(r) = (r/6.5 /j^'Mpc)"^, drawn to guide the eye. No appreciable difference is observed at fixed redshift, due to the large uncertainties in the correlation functions. 
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Fig. 9. — Clustering of blue, red and reddened quasars, using the samples defined in j|2.1l Solid and dashed lines are the best-fit power-law models excluding 
and including negative data bins in the fits, respectively. Left panels: redshift space correlation functions. Right panels: projected correlation functions. In the 
upper two panels we show the auto-correlation functions for the three samples; there is no appreciable difference between the red and blue quasar samples. The 
auto-correlation function of the reddened sample is too noisy to be useful. In the bottom panels, we show the auto-correlation functions for the red-Fblue sample 
and the cross-correlation of the reddened sample with the red-Fblue sample. Again no discernible difference is observed given the uncertainty levels in the CF 
measurements. 
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Fig. 10. — Clustering results for the radio-loud and radio-quiet samples as defined in ^2.11 Solid and dashed lines ai'e the best-fit power-law models excluding 
and including negative data bins in the fits, respectively. Left: redshift space correlation function. Right: projected correlation function. The cross-correlation 
function of the radio-loud sample with the radio-quiet sample is larger than the auto correlation function of the latter sample, as shown in both panels. 
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Fig. 11. — Binned CPs for the two high-z luminosity bins. Left: redshift space congelation function. Right: projected correlation function. Despite the absence 
of detectable correlation for the fainter sample (triangles) at r < 20 A^'Mpc, which may be caused by systematic effects (see the text), there is no appreciable 
difference at large scales (r > 20 /i^'Mpc) for the two samples, although the error bars are large. 
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Fig. 12. — Measured quasar bias factors for various samples. The filled circles in the main frame are results for all quasars within each redshift bin (no 
magnitude cut) and results both with and without negative data points in the fitting procedure are shown. The solid and dotted Unes are halo bias factor evolution 
for fixed halo mass (from bottom to top) Mhaio = 5 X lO", 1 X lO'-, 2 X lO'^, 4 X lO'^, 8 X lO'^, 1.6 X lO'^ /j-'Mq using the Sheth et al. (2001) and Jing 
(1998) fitting formulae respectively. The Jing (1998) formula generally gives larger halo bias than the Sheth et al. (2001) formula for the same halo mass. In the 
inset we show the quasar biases for our 0.4 < z< 2.5 assembly samples divided by luminosity (filled circles; black for faint quasars, red for bright quasars, and 
green for the brightest quasars), virial mass {open squares; black for low-mass, red for high-mass and green for the most massive quasars), color (filled triangles, 
blue for blue quasars, magenta for red quasars and red for reddened quasars), and radio detection (circles with x; black for FIRST-undetected quasars and red 
for FIRST-detected quasars). Their median redshifts have been shuffled to avoid clustering in the plot. 
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Fig. 13. — Simulated virial (left) and true (middle) BH masses. In addition to the Malmquist-type bias between the virial and the trae BH masses (e.g., Shen et 
al. 2008b), the two samples divided using the median virial BH masses overiap substantially in their true masses. In the right panel we show the distributions of 
BH masses relative to the median values. The solid and dotted histograms show the distributions of deviation from the median value (which varies with redshift) 
for obsen'ed and simulated virial BH masses, which are in reasonable agreement. The dashed histograms show the distributions of simulated true BH masses 
of the two samples divided by the median virial BH masses. The median difference in the two samples are 0.4 dex in virial BH masses and 0.2 dex in true BH 



